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NMR- SOLVE METHOD FOR RAPID IDENTIFICATION 
OF BI-LIGAND DRUG CANDIDATES 

This application is a continuation-in-part of 
application Ser. No. 09/326,435, filed June 4, 1999, the 
content of which is hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

The present invention relates to drug discovery 
methods, more specifically to NMR methods for identifying 
atoms of interest in enzyme ligands for generating and 
screening combinatorial libraries of bi-ligand drug 
candidates . 

BACKGROUND INFORMATION 

Widespread and sometimes indiscriminate use of 
antibiotics has allowed certain strains of disease- 
causing bacteria to become resistant to commonly 
available antibiotics. As a result, the need for more 
effective antimicrobial drugs is becoming more pressing. 
One approach to developing such drugs is to find 
compounds that bind to essential enzymes in bacteria. 
When such enzymes have two adjacent binding sites, it is 
especially useful to find "bi-ligand" drugs that can bind 
at both sites simultaneously. Such drugs are likely to 
bind extremely tightly, inactivating the enzyme and 
ultimately killing the bacteria. 
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The rapid discovery and development of bi- 
ligand drugs has been difficult. Bi-ligand drug 
candidates have been identified using rational drug 
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design, but previous methods are time-consuming and 
require a precise knowledge of structural features. When 
searching for a drug that binds to an enzyme at two 
binding sites, it would be particularly useful to 
5 understand how a ligand binds to the enzyme. 

Specifically, which atoms in the ligand interact with 
which portions of the enzyme's binding sites? 

Recent advances in nuclear magnetic 
10 spectroscopy (NMR) have allowed the determination of the 

three-dimensional interactions between a ligand and an 
enzyme in a few instances. However, these efforts have 
been limited by the size of the enzyme and can take years 
to map and analyze the complete structure of the 
15 complexes of enzyme and ligand. 

Thus, there is a need to more rapidly identify 
which atoms in the ligand interact with which portions of 
the enzyme binding sites so that focused combinatorial 
20 libraries can be generated and screened for more 

effective drugs. The present invention satisfies this 
need and provides related advantages as well. 

SUMMARY OF THE INVENTION 

25 

The present invention provides a method for 
rapidly identifying drug candidates that can bind to an 
enzyme having at least two binding sites. The first site 
on the enzyme is the "common ligand site" where a known 

30 ligand can bind to the enzyme, as well as to other 

related enzymes. The second site is a "specificity 
ligand site" adjacent to the common ligand site. Thus, 
the method identifies bi-ligand drug candidates that can 
bind at both the common ligand site and the specificity 

35 ligand site. As a result, the candidates can bind with 
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high affinity to the enzyme. As a further result,, the 
candidates can be used to bind to related enzymes sharing 
a similar common ligand site. 

5 The bi-ligand drug candidates are screened from 

a combinatorial library. Like other combinatorial 
libraries, a number of diverse compounds can be generated 
off of a core structure. In the case of a bi-ligand 
library, this core structure can be a mimic of the common 
10 ligand. The mimic can then be derivatized with varying 

groups at a selected point to generate the diversity of 
drug candidates in the library. The library is "focused" 
by optimizing the specific points on the mimic where 
variation occurs. 

15 

The optimal points of variation on the ligand 
are identified by determining which atoms are proximal to 
the specificity ligand site when the mimic is bound to 
the common ligand site. These atoms are identified by 

20 first determining which amino acids of the enzyme are 

proximal to the specificity ligand site, and then 
identifying which atoms on the bound common ligand mimic 
are proximal to these amino acids. NMR methods using the 
nuclear Overhauser effect (NOE) are particularly useful 

25 for identifying proximal atoms. Accordingly, this 

technique has been named Nuclear Magnetic Resonance- 
Structure Oriented Library Valency Engineering or NMR- 
SOLVE SM . As a result of NMR-SOLVE SM , the identified 
proximal atoms can then be used as a point for variation 

30 to generate a focused combinatorial library of high 

affinity drug candidates that can bind to both the common 
ligand site and the specificity ligand site of an enzyme 
of interest, as well as related enzymes sharing a similar 
common ligand site. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Figure la shows a schematic enzyme 1 having a 
specificity ligand site (SL site) la and a common ligand 
5 site (CL site) lb. For purposes of illustration, an 

interface region lc is shown between the SL site la and 
the CL site lb. Enzyme atom Id is within the interface 
region lc and enzyme atom le is outside the interface 
region lc. Figure lb shows a specificity ligand (SL) 2 
10 bound to the SL site la and a common ligand (CL) 3 bound 

to the CL site lb. The CL 3 has a reactive atom 3a 
within a reactive region 3b. A nonbound common ligand 
mimic (CL mimic) 4 is also shown, having individual atoms 
4a, 4b. 

15 

Figure 2 illustrates the first stage of 
identifying an atom Id of the interface region lc that is 
proximal to the reactive atom 3a of the CL 3 . In Figure 
2a, the reactive atom 3a of the CL 3 is perturbed. In 

20 turn, a nearby atom Id in the interface region lc of the 

enzyme 1 becomes perturbed, as shown in Figure 2b. Atom 
le, which is outside of the interface region lc, is more 
distant from the reactive atom 3a and is not detectably 
perturbed. Thus, detection of the perturbation of nearby 

25 atom Id allows its identification as an atom proximal to 

the reactive atom 3a of the CL 3. 

Figure 3 illustrates the second stage of 
identifying which CL mimics are proximal to the interface 

30 region lc and identifying an atom 4a of the CL mimic 4 

that is proximal to the part of the interface region lc 
identified as Id. In Figure 3a, the atom Id previously 
identified in the interface region lc is perturbed. An 
atom 4a in the CL mimic 4 then becomes perturbed, as 

35 shown in Figure 3b, but not atom 4b, which is more 



distant from interface atom Id. Consequently, an atom 4a 
of the CL mimic 4 is identified that is proximal to the 
part of the interface region lc identified as Id. As a 
further benefit, Figure 3c illustrates that the second 
5 stage can determine whether a particular CL mimic 5 binds 

to the CL site lb, but does not bind proximally to the 
interface region lc. Thus, if interface atom Id is 
perturbed, it will be too distant from atoms of the 
distally binding CL mimic 5 so that atom 5a does not 
10 become perturbed. 

Once a proximal atom 4a has been identified in 
the CL mimic 4, Figure 4a illustrates a focused 
combinatorial library 6 of bi-ligand drug candidates 6a, 

15 6b, 6c and 6d, having varying substituent groups attached 

to the identified proximal atom 4a. These are in 
contrast to drug candidates 7a to 7h from an unfocused 
combinatorial library 7 based on substitutions at other 
atoms such as 4b of the CL mimic 4 or substitutions to a 

20 distally binding CL mimic 5. Upon screening the focused 

library 6, a particular drug candidate 6c is selected for 
high binding affinity to the enzyme. As shown in Figure 
4b, the drug candidate 6c consists of the CL mimic 4 
attached to a SL-binding moiety 8 through a linker 9. 

25 The drug candidate 6c can then bind the enzyme 1 tightly 

at both the SL site la and the CL site lb. 

Figure 5 illustrates a variant method for 
identifying a proximal atom 4a. In Figure 5a, a CL 3 is 

30 bound to the CL site lb in the presence of nonbound CL 

mimic 4 . An atom 3a of the CL 3 is perturbed with 
radiof requency irradiation, transferring energy to a 
nearby atom Id of the interface region lc in an NOE 
experiment. The CL 3 then unbinds, as in Figure 5b. In 

35 Figure 5c, a CL mimic 4 then binds to the CL site lb, so 
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that the energy is then transferred from the atom Id of 
the interface region lc to a nearby atom 4a of the CL 
mimic 4, as shown in Figure 5d. As a result, the variant 
method allows identification of an atom 4a of the CL 
5 mimic 4 that is proximal to the interface region lc. 

Figure 6 illustrates the identification of 
amino acids at the interface region. Figure 6a shows the 
results of a 2D-HSQC NMR experiment with TROSY for an 

10 NADH-bound dehydrogenase. Figure 6b shows the results of 

a comparable experiment with an NADD-bound dehydrogenase, 
where the NADD has a deuterium-f or-hydrogen substitution 
on the 4-carbon position of the nicotinamide ring. As is 
well known, the 4-carbon of NADH is close to the 

15 interface region of dehydrogenases. In both figures, the 

x-axis represents the 1 H chemical shift and the y-axis 
represents the 15 N chemical shift. As a result of 
chemically perturbing the NADH to NADD, changes in the 
chemical shif ts— represented by the two arrows--permit 

20 identification of amino acids at the interface region of 

the enzyme. 

Figure 7 illustrates the identification of the 
proximal atom in the CL mimic using 3D-HSQC NOESY with 

25 TROSY. In Figure 7a, the horizontal axis represents the 

1 H chemical shift of protein, the vertical axis represents 
the 15 N chemical shift of protein and the oblique axis 
represents all 1 YL chemical shifts. The broken arrows in 
Figure 7a represent NOEs 10 resulting from radio 

30 frequency irradiation of the sample. These NOEs 10 are 

to ligand proton with chemical shift 11. The NOEs 10 
allow identification of a proximal atom 4a of a CL mimic, 
shown in Figure 7b. 
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Figures 8 to 13 illustrate the results 
discussed in Example II. 

Figure 8a shows the 2-D structure of the CL 
5 mimic designated TTE0001 . 002 . D2 . The positions of 

protons HI, H2 , H3, H4, Gl, G2 and G3 on the CL mimic are 
indicated. Figure 8b shows a Lineweaver-Burk plot for 
the compound in competition with NADH for lactate 
dehydrogenase . 

10 

Figure 9 shows the 2-D structures of the SL 
mimic 2 , 6-pyridinedicarboxylate (2,6-PDC) (Figure 9a), 
the common ligand NADH (Figure 9b) , and CL analog AcNADH 
(Figure 9c) , which is used for perturbation by chemical 
15 modification. AcNADH differs from NADH by a substitution 

from amine in NADH to methyl in AcNADH, shown at the far 
right of Figures 9b and 9c. Furthermore, individual 
protons H 1<A , H 8A and H 2N on AcNADH and corresponding atoms 
on NADH are indicated. 

20 

Figure 10 illustrates step (a) of the method of 
the invention, showing ^-"C HMQC spectra of threonine- 
labeled apo DHPR in the absence of ligand (Figure 10a), 
in the presence of common ligand NADH (Figure 10b) and in 
25 the presence of CL analog AcNADH. Peaks 2 and 12 are 

indicated with arrows. 

Figure 11 provides a chemical shift list of 
threonine residues in DHPR. The list separately provides 

30 the l R and 13 C chemical shifts for free DHPR (from 

Figure 10a), DHPR bound to common ligand NADH (from 
Figure 10b) and DHPR bound to CL analog AcNADH (from 
Figure 10c) . Cross-peaks were arbitrarily assigned 
numbers. The list then tabulates the changes in 13 C and 

35 X H shifts between free DHPR and DHPR bound to NADH (A5av 



NADH) , and then tabulates the changes between free DHPR 
and DHPR bound to AcNADH (A5av AcNADH) . The shifts were 
quantitated according to the starred equations shown. 
Shifts were normalized for proton (-^1.44) and carbon 
5 (h-33.7) chemical shift dispersion in the methyl region. 

The differences between these comparisons are further 
compared (A5av NADH versus A5av AcNADH) to identify 
potential atoms in the interface region. 

10 Figure 12 also illustrates step (a) of the 

invention, showing the 2D-NOESY spectrum of the binary 
complex of DHPR with CL analog AcNADH. The mixing time 
was 200 msec. Atoms of AcNADH are as defined in Figure 
9c. Chemical shift A corresponds to a methyl group on a 

15 threonine in a part of the CL site remote from the 

interface region that is close to atoms H 8A and H VA in 
AcNADH. In contrast, chemical shift B is for the atom 
corresponding to cross-peak #2 in Figure 10c, with a 
proton chemical shift of 1.5 ppm. This atom is close to 

20 proton H 2N in AcNADH, thereby confirming H 2N as an proton 

proximal to the interface region. 

Figure 13 illustrates step (b) of the method of 
the invention. The 2D NOESY spectrum is of the ternary 

25 complex of DHPR with the CL mimic TTE0001 . 002 . D2 (Figure 

8a) and an SL analog, 2,6-PDC (see Figure 9a). The SL 
analog was used to promote a closed conformation for the 
enzyme. The atoms of TTE0001 . 002 . D2 are as defined in 
Figure 8a. Cross-peak #2 corresponds to cross-peak #2 in 

30 Figure 10c. There is a slightly different proton 

chemical shift because SL mimic 2,6-PDC is bound, as well 
as because the CL mimic rather than AcNADH is bound. As 
a result, HI is identified as an atom on the CL mimic 
TTE0001 . 002 . D2 proximal the interface region, since HI is 

35 known to have a proton chemical shift of 7.0 ppm from 
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independent assignments using 2D COSY and ID proton 
spectra . 



DETAILED DESCRIPTION OF THE INVENTION 

5 

The present invention provides methods for 
generating focused combinatorial libraries and for 
screening them for drug candidates that can bind tightly 
to certain enzymes. The libraries are "focused" by 

10 selecting the best core structures and optimizing the 

location of variations applied to a given core structure, 
which serves as a platform for the particular library. 
Focusing the variations to a limited set of points on the 
core structure reduces the number of compounds to be 

15 generated and accelerates the drug discovery process. 

Using a core structure that binds to a common site shared 
by members of a gene family can also make the library 
generally useful for the other members. The variations 
in each library can be optimized by identifying specific 

20 atoms of interest in the core structure. For the 

purposes of the invention, the core structure is a 
ligand — or a chemical mimic to such a ligand — that binds 
to an enzyme. 

25 The term "enzyme" herein means any protein that 

catalyzes a biochemical reaction. Proteins having non- 
amino-acid modifications such as glycosylation or 
containing other non-proteinaceous components such as 
metal ion prosthetic groups are included within this 

30 definition. 

Enzymes useful in the invention are not limited 
by their size. For example, an enzyme can have a monomer 
molecular weight greater than 15, 25, 30, 35 or 45 
35 kD. An enzyme can also have a complete molecular weight 
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greater than 30, 50, 75, 100 or 150 kD. An enzyme can 
also have a monomer molecular weight less than 120, 100, 
80, 60 or 40 kD, or a complete molecular weight less than 
200, 150, 100 or 50 kD. The term "molecular weight" 
5 herein means the sum of the atomic weights of all the 

atoms in the molecule. The molecular weight can be 
estimated using well known techniques such as SDS-PAGE 
under reducing or non-reducing conditions. Molecular 
weights can also be determined using mass spectrometry, 
10 such as FAB, which includes cesium ion bombardment, 

electrospray ionization (ESI) and matrix-assisted laser 
desorption ionization (MALDI) . 

The enzyme can be obtained from any natural 
15 source such as human subjects, primates, mammals, 

animals, plants, fungi or bacteria, including any derived 
cell or tissue cultures. In particular, the enzyme can 
be from a human or animal pathogen. An enzyme can also 
be derived originally from a natural source, but later 
20 modified by artificial means, for example a recombinant 

enzyme or chemically modified enzyme. 

The enzyme can also be a member of a gene 
family. The term "gene family" herein means a group of 

25 genes — or proteins encoded by such genes — often derived 

by duplication and variation from a common ancestral 
gene, exhibiting sequence homology and related phenotypic 
functions. More specifically, a gene family is a family 
of proteins that can all bind a common ligand such as 

30 NADH or ATP. As such, gene family members can often be 

identified by the presence of a conserved amino acid 
sequence motif. Several large gene families have been 
identified, including families as large as 20, 50, 100 
and even 200 members. Two particular examples of a gene 

35 family are kinases and oxidoreductases . The term 



"kinase" herein means any enzyme that catalyzes the 
transfer of a phosphoryl group from ATP or other 
nucleoside triphosphate to another compound. The term 
"oxidoreductase" herein means any enzyme that catalyzes 
5 an oxidation-reduction reaction. Still other gene 

families include transaminases, decarboxylases and 
methyltransferases, as well as families of farnesyl-, 
geranyl-, geranyl-geranyl- and ubiquitin-transf erring 
enzymes . 

10 

Another particular gene family is the 
dehydrogenase gene family. The term "dehydrogenase" 
herein means any enzyme that catalyzes the removal of 
hydrogen from a substrate using a compound other than 

15 molecular oxygen as an acceptor. Typically the hydrogen 

is transferred to the coenzyme NAD + (nicotinamide adenine 
dinucleotide) or NADP + (nicotinamide adenine dinucleotide 
phosphate) . The dehydrogenase gene family is large, 
comprising approximately 17% of all enzymes (You, 

20 Kwan-sa, "Stereospecif icity for Nicotinamide Nucleotides 

in Enzymatic and Chemical Hydride Transfer Reactions," 
CRC Crit. Rev. Biochem. 17:313-451 (1985)). A 
particularly useful enzyme is dihydrodipicolinate 
reductase (DHPR) , which is 28 kD in monomer form, and is 

25 a tetramer in solution. Thus, the dehydrogenase family 

is likely to be a rich source of drug targets. 

The three-dimensional structures of several 
dehydrogenases are known, including dogfish lactate 
30 dehydrogenase, soluble porcine malate dehydrogenase, 

horse liver alcohol dehydrogenase, lobster 
glyceraldehyde-3-phosphate dehydrogenase and Bacillus 
stearothermophilus glycer aldehyde- 3 -phosphate 
dehydrogenase (apo form) . Based on these structures, it 
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is now understood that dehydrogenases share several 
conserved features, including two characteristic domains. 



The first domain conserved among the 
5 dehydrogenases contains a site that can bind NAD", NADP + 

or a similar hydrogen acceptor (Bellamacina, C.R., "The 
Nicotinamide Dinucleotide Binding Motif: A Comparison of 
Nucleotide Binding Proteins," FASEB J. 10:1257-1269 
(1996)). Because the NAD + ligand that can bind at such a 
10 site is shared in common with other members of the gene 

family, the NAD + -binding site can be described as a common 
ligand site. The term "common ligand site" herein means 
a location on any enzyme where a common ligand can bind. 

15 The term "common ligand" or "CL" herein means a 

molecule that can selectively bind at a site on an 
enzyme. A particularly useful CL can bind to a site 
conserved in a family of enzymes. The term can therefore 
extend to molecules that can bind to members of a gene 

20 family. 

A useful CL is a cof actor. The term "cof actor" 
herein means any small molecule that can bind in the CL 
site and participates in catalysis when bound to an 

25 enzyme. Cof actors often contain a nucleotide such as 

adenine mononucleotide or nicotinamide mononucleotide. 
Examples of such cofactors include ATP, ADP and SAM (S- 
adenosyl methionine) . Another group of cofactors that 
contain a nucleotide is the group NAD + , NADH, NADP + and 

30 NADPH. Other such cofactors include FMNH 2 , FMN, FAD, 

FADH 2 , CoA, GTP and GDP. Still other cofactors include 
THF, DHF, TPP, biotin, dihydropterin, heme, farnesyl and 
f arnesyl-pyrophosphate, geranyl, geranyl-geranyl , 
ubiguitin, pyridoxal phosphate and thiamine 

35 pyrophosphate. 
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CLs can have at least one atom that 
participates in the reaction mechanism catalyzed by the 
enzyme. In the case of NADH, this "reactive atom" is a 
hydrogen atom attached to the carbon-4 position of the 
5 nicotinamide ring, which is transferred as a hydride ion. 

For purposes of definition, a "reactive region" of the 
common ligand can be defined to encompass the reactive 
atom as well common ligand atoms that are immediately 
adjacent to the reactive atom. The reactive region also 

10 encompasses atoms of the common ligand that are 

immediately adjacent to the specificity ligand. In this 
definition, "immediately adjacent" means within 5 
Angstroms, but can be within 2, 3, 4 or 6, 7 or 8 
Angstroms. In functional terms, it can also mean 

15 sufficiently close that a chemical perturbation in one 

atom can be detected in a second atom, using NMR methods 
disclosed in greater detail below. Thus, a reactive 
region can mean the reactive atom itself as well as atoms 
of the common ligand that are immediately adjacent to the 

20 reactive atom or common ligand atoms immediately adjacent 

the SL. Since the reactive atom of a CL can react with 
an SL chemically, it is located at the interface of the 
CL and SL binding sites at some point in time. 

25 Although the common ligands can be naturally 

occurring cofactors or molecules, the term also extends 
to chemical analogs that serve as mimics of the common 
ligand. Thus, "common ligand" should be understood to 
encompass "common ligand mimics." 

30 

The term "ligand mimic" herein means a molecule 
that can bind to the enzyme at the same site as the 
ligand. Thus, a CL mimic can displace a CL bound to an 
enzyme at the CL site. This property can be demonstrated 
35 by enzyme-binding competition assays between a natural 
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common ligand and a common ligand mimic. The term can 
encompass molecules having portions similar to 
corresponding portions of the ligand in terms of 
structure or function. Because CLs can include CL mimics 
5 that are analogs of naturally occurring CLs, and because 

the scope of the term "CL mimic" can also encompass the 
original CL itself, the scope of the terms "CL" and "CL 
mimic" should be considered coextensive. 

10 Examples of mimics to the common ligand NADH, 

for example cibacron blue, are described in Dye-Liqand 
Chromatography , Amicon Corp., Lexington MA (1980). In 
the case of cibacron blue, the similarity with NADH may 
not be immediately obvious from schematic drawings of 

15 their chemical structures, and may only become apparent 

when their three-dimensional structures are compared. 
Thus, even when it is known that cibacron blue can bind 
at NADH-binding sites on an enzyme, it may be difficult 
to assign correspondences between particular atoms in the 

20 CL and the CL mimic. 

Numerous other examples of NADH-mimics, 
including useful modifications to obtain such mimics, are 
described in Everse et al. (eds.), The Pyridine 

25 Nucleotide Coenzymes , Academic Press, New York NY (1982) . 

Particular analogs include nicotinamide 2-aminopurine 
dinucleotide, nicotinamide 8-azidoadenine dinucleot ide , 
nicotinamide 1-deazapurine dinucleotide, 3-aminopyridine 
adenine dinucleotide, 3-diazoacetylpyridine adenine 

30 dinucleotide and 5-aminonicotinamide adenine 

dinucleotide. Particular CL mimics can be identified and 
selected by ligand-displacement assays, as is well known 
in the art. CL mimic candidates can also be identified 
by searching databases of compounds for structural 

35 similarity with the common ligand or a CL mimic. 
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An example of a naturally occurring CL is NADH 
(Figure 9b) . Another CL is AcNADH (Figure 9c) , which is 
an analog of NADH. An example of a CL mimic is shown in 
Figure 8a. When a CL or CL mimic has a structural or 
functional similarity to a known naturally occurring CL, 
such as between NADH and AcNADH, a reactive atom of a CL 
can be identified by analogy to the reactive atom of a 
known naturally occurring CL. 

Unlike the first domain conserved among the 
dehydrogenases, the second domain can differ markedly 
among the individual enzymes and can bind a unique 
substrate from which the hydrogen is removed. 

The term "specificity ligand" or "SL" herein 
means a molecule that can bind to a second site on the 
enzyme. A particularly useful SL can bind to a unique or 
a relatively small subset of a gene family, where the 
family shares binding properties with a single CL. 
Accordingly, the term "specificity ligand site" is the 
location where the specificity ligand can bind to an 
enzyme . 

While specificity ligands can be naturally 
occurring molecules, they can also be chemical analogs 
that serve as mimics to naturally occurring molecules. 
These can be referred to as SL mimics, although 
"specificity ligand" should be understood to encompass 
"specificity ligand mimics" as well. Because SLs can 
include SL mimics that are analogs of naturally occurring 
SLs, and because the scope of the term "SL mimic" can 
also encompass the original SL itself, the scope of the 
terms "SL" and "SL mimic" should be considered 
coextensive. Specificity ligands can also have a 
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reactive atom and reactive region, corresponding to the 
reactive atom and reactive region defined for CLs above. 

In summary, such enzymes can transfer a 
5 reactive atom between a SL bound at the SL site and a CL 

bound at the CL site. For example, an NAD + -dependent 
dehydrogenase can transfer a hydride ion from the SL to 
the CL, which is NAD + . 

Given the catalytic function of dehydrogenases 
and other enzymes, it follows that the CL site and the SL 
site should be physically located in proximity in the 
enzyme's three-dimensional structure to facilitate 
transfer of the hydrogen or other reactive atom. In 
fact, the three-dimensional geometric relationship 
between the CL site and SL sites has been shown to be 
conserved in evolutionarily related dehydrogenases (Sem 
and Kasper, "Geometric Relationship Between the 
Nicotinamide and Isoalloxazine Rings in NADPH-Cytochrome 
P450 Oxidoreductase, " Biochemistry 31:3391-3398 (1992)). 
Thus, the CL site and SL site are said to be "adjacent." 
The domains of a protein may move relative to each other, 
depending on a variety of factors, such as substrate 
binding, so the protein can be described as being in an 
open or closed conformation. Nevertheless, in practical 
terms, an atom of the CL site and an atom of the SL site 
are at some point in time at most 3, 4, 5, 6, or 7 A 
apart . 

30 The portion of the enzyme between the adjacent 

CL and SL sites can be defined as the "interface region." 
In Figure la, the interface region is shown schematically 
as 1c. Because an SL bound at the SL site can have 
similar properties as enzyme atoms of the interface 

35 region, the term "interface region" can also encompass 
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20 



atoms of an SL if an SL is bound to the enzyme at the SL 
site . 



While the enzymes have been discussed in 
reference to dehydrogenases for purposes of illustration, 
it should be understood that any other enzyme will be 
useful in the invention so long as (1) the enyzme can 
bind a CL or a CL mimic at a CL site and binds an SL at 
an adjacent SL site; (2) an interface region can be 
defined as the atoms of the enzyme between the CL site 
and SL site, including atoms of a SL if bound to the 
enzyme; (3) the enzyme catalyzes a reaction mechanism 
involving a SL and a reactive atom of a CL; and (4) a CL 
reactive region can be defined as the reactive atom of 
the CL and CL atoms immediately adjacent to the reactive 
atoms or CL atoms immediately adjacent to the SL. 

To prepare an enzyme for use in the method of 
the invention, the enzyme can be isolated or expressed by 
recombinant methods, harvested and purified by any 
conventional method well known in the art. A 
particularly useful enzyme can be isotopically labeled 
with 2 H, 13 C, 15 N or any combination of these isotopes 
(Venters et al., "High-level 2 H/ 13 C/ 15 N Labeling of 
Proteins for NMR Studies," J. Biomo l. NMR 5:339-344 
(1995)). Following such labeling, deuterated amides can 
be exchanged with protons, depending on the particular 
NMR method used, as discussed below. A high level of 
amino-acid-specif ic protonation can be re-introduced in 
the protein. For example, to introduce protonation, 
this can be achieved by using directly protonated amino 
acids, or amino-acid precursors when the protein is 
expressed in bacteria (Rosen et al . , "Selective Methyl 
Group Protonation of Perdeuterated Proteins," J . Mol . 
Biol. 263:627-636 (1996)). The amino acids that are 
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particularly useful contain methyl groups, although any 
amino acids can be introduced. 



Having disclosed enzymes and ligands useful in 
5 the invention, the present invention provides a method 

for identifying an atom of a common ligand mimic that is 
proximal to an interface region. The method comprises 
two separate stages. In the first stage, an atom of the 
interface region is identified that is proximal to the 

10 reactive region of the common ligand. Second, the atom 

identified in the first stage is used to identify an atom 
in the CL mimic that is proximal to the interface region. 
Because this atom will also be proximal to the SL site, 
the atom or immediately adjacent atoms can then serve as 

15 an optimal point of variation when generating a 

combinatorial library of bi-ligand compounds that can 
bind at both the CL and the SL sites. 

The first stage is performed by binding a CL to 
20 the CL site of the enzyme. The SL may also be bound to 

the enzyme. Then, an atom of the CL reactive region is 
perturbed by any of the various methods disclosed in 
further detail below. Thus, an interface atom can be 
identified that is perturbed by the perturbation of the 
25 CL reactive region, and hence is proximal to the CL. 

In the second stage, a CL mimic is bound to the 
CL site. Again, the SL may be bound to the enzyme. The 
interface atom identified during the first stage is then 
30 perturbed. As in the first stage, this perturbation then 

causes further perturbation in nearby atoms in the CL 
mimic, which can also be detected, thereby identifying an 
atom of the CL mimic that is proximal to the interface 
region . 

35 
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As a comparison, atoms of the CL mimic that are 
not proximal to the interface atom do not become 
perturbed during the second stage. Similarly, if a CL 
mimic is bound to the CL site so that its atoms are not 
5 proximal to the interface region, the CL mimic atoms will 

also not become perturbed. Thus, only proximal atoms of 
the CL mimic are identified. 

In addition to the general method for 
10 identifying proximal atoms disclosed above, several 

variations are also encompassed within the invention. 
These variant methods also provide the identity of atoms 
in the common ligand or CL mimic that are proximal to the 
interface region of the enzyme, and therefore also 
15 proximal to the SL site. 

Where an atom of the interface region has 
already been identified in the enzyme, the method can 
proceed directly to the second stage. Such 

20 identification can be by any means available, such as by 

homology modeling, mutagenesis or limited assignment. 
Thus, the method can comprise the steps of (1) binding a 
CL mimic to the CL site; (2) perturbing the identified 
atom of the interface region; and (3) identifying an NMR 

25 cross-peak corresponding to an atom of the CL mimic that 

is perturbed by the perturbation of the interface atom, 
thereby identifying an atom of the CL mimic that is 
proximal to the interface region. 

30 Another variation makes use of transient 

binding of CL and CL mimics at the CL site of the enzyme 
in a transferred NOE experiment. As shown in Figure 5a, 
a CL is bound to the CL site in the presence of unbound 
CL mimic. An atom of the CL can be perturbed by 

35 radiof requency irradiation in an NOE experiment so that 
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energy is transferred from the reactive region CL atom to 
the interface region. The CL is then allowed to unbind, 
as shown in Figure 5b. When a CL mimic binds at the same 
CL site, as shown in Figure 5c, magnetization is 
5 transferred back from the interface region to perturb an 

atom in the CL mimic, as shown in Figure 5d. As a 
result, an NMR cross-peak can be identified corresponding 
to the atom of the CL mimic proximal to the interface 
region . 

10 

Yet another variation involves the observation 
of NOEs on dissociated ligands. An interface atom of the 
enzyme is perturbed, for example by irradiation, followed 
by magnetization transfer to CL mimic bound to the 
15 enzyme. Upon dissociation of the CL mimic from the 

enzyme, the NOE can then be observed before magnetization 
decay in the nonbound CL mimic. Thus, an atom of the 
dissociated CL mimic can be identified that was proximal 
to the interface region when bound to the CL site. 

20 

The term "perturb" herein means to affect the 
chemical shift, intensity or lineshape of the NMR signal 
for a nucleus so that the effect can be detected in an 
NMR experiment. Perturbation also includes causing a 
25 detectable change in nuclear spin relaxation rates (1/T2 

and 1/T1) . 

A particular perturbation method is to 
chemically alter an atom or an immediately adjacent atom. 

30 An example is the chemical substitution of reactive 

region atoms, such as replacing hydrogen with deuterium. 
Another substitution is replacing an amide with a 
carboxylic acid. These substitutions alter the 
electronic and structural environment of nearby atoms in 

35 the interface region, which produces changes in their NMR 
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chemical shifts. Other chemical alterations include the 
introduction of paramagnetic or quadrupolar nuclei. 



For example, a reactive hydrogen in NADH or 
5 NADPH can be perturbed by replacing the hydrogen with 

deuterium, resulting in NADD or NADPD (Sem and Kasper, 
"Geometric Relationship between the Nicotinamide and 
Isoalloxazine Rings in NADPH-Cytochrome P-450 
Oxidoreductase: Implications for the Classification of 

10 Evolutionarily and Functionally Related Flavoproteins , " 

Biochemistry 31: 3391-3398 (1992)). In this sense, the 
perturbation from NADH to NADD can be considered to be 
either the chemical alteration itself or the particular 
differences in the NMR spectra of an enzyme binding NADH 

15 and the NMR spectra of an enzyme binding NADD. 

Similarly, an atom of the enzyme can be perturbed by 
site-directed mutagenesis so that the differences between 
the spectra of the original enzyme and the mutated enzyme 
can be detected. 

20 

Yet another method of perturbation is to 
irradiate an atom with radio frequency energy (Rf 
irradiation) . The term "radio frequency energy" herein 
means oscillating electrical voltages, currents or 

25 electromagnetic fields with frequencies in the range of 

10 to 1000 MHZ or 10 7 to 10 9 sec' 1 . The frequency selected 
depends on the magnetic field strength and the 
corresponding Larmor frequency for the nucleus of 
interest at that field strength. Thus, differences in 

30 NMR spectra can be observed in samples with and without 

irradiation . 



35 



In modern multidimensional NMR NOE experiments, 
excitation is performed at a range of frequencies 
simultaneously so that frequencies are read off axes in 



each dimension. The central frequency and frequency 
range is selected for a given nucleus type so that it 
will excite all of those nuclei in the molecule. For 
example, at an 11.7 Tesla magnetic field strength, 
5 protons are excited at 500.0 MHZ (the Larmor frequency), 

with a range of at least + 5000 Hz. 

As a result of perturbing a particular atom, 
other nearby atoms can be perturbed as well. The nuclear 

10 Overhauser effect (NOE) can cause detectible changes in 

the NMR signal of an atom that is proximal to the 
perturbed atom. Here, NOE is meant to include steady 
state and transient NOE. The signal changes are the 
result of magnetization transfer to the proximal atoms. 

15 Since an NOE occurs by spatial proximity, not merely 

connection via chemical bonds, it is especially useful 
for determining distances between molecules. The term 
"proximal" herein means within a defined distance of one 
or more atoms of interest, where the defined distance is 

20 a function of the method used to perturb. Functionally, 

"proximal" can be defined as being within a distance 
where perturbation can be detected. When an NOE is used, 
the distance is usually 5 Angstroms, but can be 2, 3, 4 
or 6, 7 or 8 Angstroms. 

25 

Perturbation--whether by chemical alteration of 
an atom or as a result of irradiation in an NOE 
experiment — can be detected and identified by a variety 
of known methods. Detectable changes in NMR signals 

30 include changes in intensity (NOE) , location (chemical 

shift) or width (linewidth) . General NMR techniques for 
proteins, including multidimensional NMR experiments and 
determination of protein-ligand interactions can be found 
in David G. Reid (ed. ) , Protein NMR Techniques , Humana 

35 Press, Totowa NJ (1997) . 
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In practice, the perturbed atoms in large 
molecules can be identified using a multidimensional 
multinuclear NMR method to identify NMR cross-peaks 
corresponding to the perturbed atoms. Heteronuclear NMR 
5 experiments are particularly useful with larger proteins 

as described in Cavanaugh et al . , Protein NMR 
Spectroscopy: Principles and Practice , ch . 7, Academic 
Press, San Diego CA (1996) . For example, two-dimensional 
NMR experiments can measure the chemical shifts of two 

10 types of nuclei. A well established 2-D method is the 1 H- 

13 N heteronuclear single quantum coherence (HSQC) 
experiment. Another method is the heteronuclear multiple 
quantum coherence (HMQC) experiment. Numerous other 
variant experiments and modifications are known in the 

15 art including nuclear Overhauser enhancement spectroscopy 

experiments (NOESY) , for example NOE experiments 
involving a { 1 H , ^-H } NOESY step. 

Ambiguity based on chemical shift overlap can 
20 be resolved by (a) introducing a third dimension for 15 N 

or 13 C chemical shift; (b) not decoupling to a heteroatom 
in one of the two dimensions, thus producing diagnostic 
1 H- 13 C or 1 H- 15 N one bond coupling constants; or (c) using 
2-D "C-iH or 15 N- 1 H HMQC or HSQC-^H^H} NOESY variants. 

25 

Higher-dimensional NMR experiments can be used 
to measure the chemical shifts of additional types of 
nuclei and to eliminate problems with cross peak overlap 
if spectra are too crowded. In particular, the NMR 

30 method used can correlate 1 H, 13 C and 15 N (Kay et al . , J. 

Magn. Reson. 89:496-514 (1990); Grzesiek and Bax, J. 
Magn. Reson. 96:432-440(1992)), for example in an HNCA 
experiment. Other heteronuclear NMR experiments can be 
used so long as the transfer of magnetization to all CL 

35 and protein protons is only to or from amide protons on 



the protein, since all carbon-attached protons in the 
protein are replaced with deuterons . Such experiments 
include HNCO, HN(CO)CA, HN (CA) CO and HNCACB experiments. 

5 Other experiments involve cross-peaks of NH 

protons of protein at Asn, Gin, Arg and/or His. 
Similarly, the cross-peaks of the methyl protons of 
protein specifically 13 C- 1 H 3 labeled at Leu, Thr, lie, Val, 
Ala and/or Met can be observed in NOESY, chemical shift 
10 perturbation or other experiments. 

Particular multidimensional techniques for 
identifying compounds that can bind to target molecules 
are described in U.S. Patents No. 5,698,401 to Fesik et 

15 al., and No. 5,804,390 to Fesik et al. Related 

publications include PCT publications WO 97/18469, WO 
97/18471 and WO 98/48264. However, these techniques, 
sometimes described as "SAR by NMR, " require the complete 
assignments and determination of the three-dimensional 

20 structure of the enzyme (Shuker et al . , "Discovering 

High-Affinity Ligands for Proteins: SAR by NMR," Science 
274:1531-1534 (1996); Hajduk et al., "Discovery of Potent 
Nonpeptide Inhibitors of Stromelysin using SAR by NMR, " 
J. Am. Chem. Soc. 119:5818-5827 (1997)). As a result, 

25 identification of suitable ligands using SAR by NMR can 

be an undertaking of more than a year. Only by 
determining the multiple structures of the enzyme while 
complexed with different pairs of ligands can these 
techniques suggest ligands to be covalently joined and 

30 tested for binding affinity. In contrast, the method of 

the invention does not require determining the complete 
assignments or structure of the enzyme; instead, it 
rapidly provides sufficient information to generate a 
focused combinatorial library of bi-ligand inhibitors 

35 without providing excess information. 
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Recent advances in NMR spectroscopy have used 
transverse relaxation-optimized spectroscopy (TROSY) to 
achieve narrow line widths, substantially increasing 
resolution and sensitivity of multidimensional NMR 
5 experiments. As a result, NMR can be applied to larger 

molecules than previously possible, such as proteins up 
to 100 kD or larger (Pervushin et al., J. Am. Chem. Soc. 
120:6394-6400 (1998); Salzman et al., "TROSY-type Triple- 
Resonance Experiments for Sequential NMR Assignments of 

10 Large Proteins, J. Am. Chem. Soc. 121:844-848 (1999)). 

Deuterium labeling and decoupling has also been used to 
achieve narrow line widths, further enabling NMR methods 
to be applied to large proteins (Yamazaki et al., "a 
Suite of Triple Resonance NMR Experiments for the 

15 Backbone Assignments of 15 N, 13 C, 2 H Labeled Proteins with 

High Sensitivity," J. Am. Chem. Soc. 116:11655-11666 
(1994) ) . 

Deuterium labeling is especially useful if all 
20 protons— except those on specific methyls, such as 

threonines — are replaced with deuterons. Such methyl 
protons should be readily detectable, even on large 
proteins . 

25 Using the NMR methods disclosed above, atoms 

proximal to perturbed atoms can be identified in the 
first stage by comparing NMR spectra with and without 
perturbation. As exemplified in Figure 6, for example, 
NMR spectra from a 2D-HSQC experiment with TROSY can be 

30 compared for NADH-bound dehydrogenase and NADD-bound 

dehydrogenase. By comparing the spectra, NMR cross peaks 
can be identified that correspond to atoms most affected 
by the change in electronic and chemical environment 
resulting from substituting deuterium for hydrogen. As a 

35 result, the atoms in the NADH-binding site closest to the 
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SL binding site, which are in the interface region, can 
be identified. 



In order to know which cross peaks in the 
5 uncomplexed dehydrogenase correspond to the perturbed 

cross peaks in the NADD-dehydrogenase complex, it may be 
necessary to titrate the uncomplexed dehydrogenase with 
increasing concentrations of either NADH or NADD and 
monitor progressive changes in chemical shifts. Such a 
10 titration is not necessary if the corresponding cross 

peaks in the uncomplexed dehydrogenase can be identified 
by other means, such as the presence of a unique pair of 
13 C chemical shifts in the HNCA experiment. 

15 In the second stage, NMR spectra can be 

collected for the enzyme in complex with a CL mimic. For 
example, a TROSY implementation of the 3D [ 1 H- 15 N] HSQC- 
NOESY is illustrated in Figure 7. The third dimension, 
labeled "all proton chemical shifts," identifies protons- 

20 -especially on the CL mimic — that receive an NOE-type 

perturbation from the protons identified in the first 
stage. The identity of these NOE-perturbed protons on 
the CL mimic are easily established based on chemical 
shift values, since all protons on the CL mimic can be 

25 assigned by standard methods well known in the art (P.L. 

Rinalidi, Two Dimensional NMR Spectroscopy: Applications 
for Chemists and Biochemists , pages 841-872 (Croasmun and 
Carlson, eds . ) , VCH Pubs, New York (1994)). Isotope 
filtering can also be used in this third dimension to 

30 select only those protons not attached to 15 N. These NOE- 

perturbed protons on the CL mimic and other atoms in this 
region represent the optimal positions for variation in 
constructing a bi-ligand library. Significantly, the 
optimal position has been identified without having 
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previously determined the complete assignments or 
structure for the enzyme. 



It may be necessary to identify which cross 
5 peaks in the enzyme-CL mimic complex correspond to the 

interface region protons identified in the first stage. 
This can be achieved by titrating the CL mimic onto the 
dehydrogenase and monitoring how the cross peak changes 
for the previously identified proton. Again, such 
10 experiments are not necessary if the cross peaks of 

interest can be identified by other means. 

In summary, to facilitate the catalytic 
mechanism of an enzyme, an atom of the reactive region of 

15 a bound common ligand will be proximal to the SL site and 

the intervening interface region. The reactive region 
can then be used to identify proximal atoms in the 
interface region. In turn, by virtue of the proximity 
between the interface region and the CL site, the 

20 ' interface region can be used to identify proximal atoms 

in a bound common ligand mimic. Such proximal atoms will 
be nearest to the SL site and provide a basis for 
building bi-ligands that can bind to both the CL site and 
the SL site. As a corollary, the method can identify, by 

25 elimination, those atoms in the CL mimic that are too 

distant from the SL site to serve as useful points of 
variation . 

In an alternate embodiment, the first stage can 
30 be performed by binding an SL to the SL site of the 

enzyme; perturbing an atom of the SL corresponding to an 
SL reactive region, and identifying an NMR cross-peak 
corresponding to an atom that is perturbed by the 
perturbation of the atom of the SL reactive region, 
35 thereby identifying an atom of the interface region. The 



second stage can be performed by binding an SL mimic to 
the SL site, perturbing the interface atom previously 
identified, and identifying an NMR cross-peak 
corresponding to an atom of the SL mimic that is 
5 perturbed by the perturbation of the interface atom, 

thereby identifying an atom of the SL mimic that is 
proximal to the interface region. 

Thus, the following is a generalized embodiment 

10 of the invention for identifying an atom of a first 

ligand mimic that is proximal to an interface region. A 
given enzyme can bind a first ligand (LI) or a first 
ligand mimic (LI mimic) at a first ligand site (LI site) 
and can bind a second ligand (L2) at an adjacent second 

15 ligand site (L2 site) . The interface region can be 

defined as the atoms of the enzyme between the Ll site 
and L2 site, but also includes atoms of L2 if bound to 
the enzyme. An Ll reactive region is defined as the 
reactive atom of Ll, as well as Ll atoms immediately 

20 adjacent to the reactive atom or Ll atoms immediately 

adjacent to L2 . The first stage involves identifying an 
atom of the interface region, by binding an Ll to the Ll 
site of the enzyme, perturbing an atom of the Ll reactive 
region; and identifying an NMR cross-peak corresponding 

25 to an atom that is perturbed by the perturbation of the 

atom of the Ll reactive region, thereby identifying an 
atom of the interface region. The second stage involves 
identifying an atom in the Ll mimic that is proximal to 
the interface region, by binding an Ll mimic to the Ll 

30 site; perturbing the interface atom previously identified 

in the first stage; and identifying an NMR cross-peak 
corresponding to an atom of the Ll mimic that is 
perturbed by the perturbation of the interface atom, 
thereby identifying an atom of the Ll mimic that is 

35 proximal to the interface region. 



29 

It should be noted that CL mimics may bind to 
different positions within a CL site. For example, a 
particular CL mimic 4 may bind relatively close to the 
interface region, as shown in Figure 3a, while another CL 
5 mimic 5 may bind distally, as shown in Figure 3c. By 

identifying atoms proximal to the interface region, the 
method can therefore be used to screen for CL mimics that 
can bind proximally to the interface region. CL mimics 
like 5 that can bind distally may not contain atoms 
10 sufficiently close to the interface region to be useful 

for generating bi-ligand libraries. 

Because the method uses NMR methods to provide 
structural information to engineer a bi-ligand library, 

15 the method can be termed Nuclear Magnetic Resonance- 

Structure Oriented Library Valency Engineering or NMR- 
SOLVE SM . Without NMR— SOLVE SM , a combinatorial library 
based on the CL mimic would involve variation at 
potentially every point on the CL mimic. With NMR- 

20 SOLVE SM , the library can be focused at the optimal point 

of variation, representing a significant savings in drug 
discovery time. 

Thus, the present invention also provides a 
25 method for generating a focused combinatorial library of 

bi-ligand compounds that can simultaneously bind to a CL 
site and an SL site of an enzyme. The term 
"combinatorial library" herein means an intentionally 
created set of differing molecules prepared by taking a 
30 base structure and, in parallel reactions, adding 

different substituent groups to points on the base 
structure, resulting in the parallel synthesis of 
compounds that are variations on the core structure. By 
taking the products as core structures in a succeeding 
35 set of parallel reactions, further variant compounds can 



be generated, resulting in a diversity of related 
compounds. As a result of the combinatorial process, the 
products are generally prepared in essentially equimolar 
quantities, considering of course the different 
5 efficiencies of the individual synthetic reactions. Not 

included within this definition are multiple isomeric and 
chiral products and undesired by-products resulting from 
a single reaction scheme. Also not included are 
intentional or accidental mixtures of originally pure 
10 compounds not arising out of the combinatorial synthetic 

process . 

A number of formats for generating 
combinatorial libraries are well known in the art, for 
15 example soluble libraries, compounds attached to resin 

beads, silica chips or other solid supports. As an 
example, the "split resin approach" may be used, as 
described in U.S. Patent No. 5,010,175 to Rutter and in 
Gallop et al., J. Med. Chem. , 37:1233-1251 (1994). 

20 

The term "substituent group" herein means any 
chemical compound or functional group that can be 
synthetically attached to a base structure. Examples of 
substituent groups suitable for addition to a base 

25 structure include halo, hydroxy and protected hydroxyls, 

cyano, nitro, C x to C 6 alkyls, C 2 to C 7 alkenyls, C 2 to C 7 
alkynyls, C x to C 6 substituted alkyls, C 2 to C 7 substituted 
alkenyls, C 2 to C 7 substituted alkynyls, C 1 to C 7 alkoxys, 
C 1 to C 7 acyloxys, to C 7 acyls, C 3 to C 7 cycloalkyls, C 3 

30 to C 7 substituted cycloalkyls, C 5 to C 7 cycloalkenyls , C 5 

to C 7 substituted cycloalkenyls, a heterocyclic ring, C 7 
to C 12 phenylalkyls, C 7 to C 12 substituted phenylalkyls, 
phenyl and substituted phenyls, naphthyl and substituted 
naphthyls, cyclic C 2 to C 7 alkylenes, substituted cyclic 

35 C 2 to C 7 alkylenes, cyclic C 2 to C 7 heteroalkylenes, 
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substituted cyclic C 2 to C 7 heteroalkylenes, carboxyl and 
protected carboxyls, hydroxymethyl and protected 
hydroxymethyls, amino and protected aminos, 
(monosubstituted) amino and protected 
5 (monosubstituted) aminos, (disubstituted) aminos, 

carboxamide and protected carboxamides, C x to C 4 
alkylthios, C x to C 4 alkylsulf onyls , C x to C 4 
alkylsulf oxides, phenylthio and substituted phenylthios, 
phenylsulfoxide and substituted phenylsulf oxides or 
10 phenylsulf onyl and substituted phenylsulf onyls . As 

discussed below, substituent groups can also include 
compounds that are ligands to enzymes such as a SL or SL 
mimics, as well as linkers. 

15 Under the method of the invention, the 

libraries are generated by first performing the methods 
disclosed above to identify an optimal CL mimic and a CL 
mimic atom that is proximal to the interface region. In 
Figure 4, for example, the proximal atom 4a has been 

20 identified. Then, compounds are synthesized by modifying 

the CL mimic at the proximal atom by attaching 
substituent groups. 

For example, a focused combinatorial library 6 
25 has been generated by using the proximal atom 4a as a 

point of variation on the CL mimic core structure 4 . The 
library contains several compounds 6a-6d, each having a 
substituent group added at the proximal atom 4a. Thus, 
the term "focused" herein means a combinatorial library 
30 where substituent groups are added to preselected points 

on the core structure. In contrast, an unfocused 
combinatorial library 7 could begin with a non-optimal CL 
mimic 5 or begin with the same CL mimic 4 as a core 
structure, but the addition of substituent groups could 
35 occur at any point on the base structure, for example at 
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an atom 4b distant from the SL site, leading to a 
substantially greater number of potentially ineffective 
bi-ligand compounds to be screened. 

5 Because the invention is directed to 

combinatorial libraries of bi-ligand compounds, a 
particularly useful substituent group includes a chemical 
moiety that can bind to the SL site of the enzyme. The 
selection of such a moiety depends on the enzyme of 
10 interest, of course, but the substituent group can be an 

SL or SL mimic of an enzyme attached to the core 
structure of the CL mimic, allowing the resulting 
compound to bind to both the SL and CL sites of the 
enzyme . 

15 

In some enzymes, the SL and CL sites can be a 
short distance apart, for example 2 A. Nevertheless, 
when seeking a compound having moieties that can bind at 
both the SL and CL sites, it can be useful to incorporate 

20 a linker into the substituent group to span the distance 

between the two sites. The term "linker" herein means 
any chemical group or portion of a molecule used to 
physically connect one moiety to another moiety. For 
example in Figure 4b, a linker 9 can connect an SL-site- 

25 binding moiety 8 to a CL mimic core structure 4 while 

maintaining the two moieties at a distance, allowing the 
compound 6c to bind the enzyme 1 tightly at both the SL 
site la and the CL site lb. Thus, a linker can also 
provide positioning and orientation of the two moieties 

30 to optimize their binding to their respective binding 

sites . 

Because the resulting library of compounds can 
bind to both sites of the enzyme, they can be described 
35 as "bi-ligands . " The term "bi-ligand" herein means any 
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molecule having at least two moieties where either moiety 
can bind to an enzyme independently or both moieties can 
bind to the same enzyme simultaneously. In such a bi- 
ligand, one moiety can be a CL or CL mimic. Similarly, 
the other moiety can be a SL or SL mimic. Thus, the 
invention also provides a combinatorial library of bi- 
ligand compounds obtained by the method of the invention 
for generating focused libraries. Such a library can 
contain at least 2, 5, 10, 15, 20 or 50 bi-ligand 
compounds. It can also contain 100, 200, 500, 1000 or 
even up to 10,000 or 100,000 bi-ligand compounds. 

As a result of their ability to bind at 
multiple sites on an enzyme, the bi-ligand compounds in 
the libraries can have high affinities to the enzyme. 
Even if two individual moieties have relatively low 
binding affinities, the combination of the two ligands in 
a single bi-ligand compound can have a synergistically 
higher binding affinity (Radzicka & Wolfenden, 
"Transition State and Multisubstrate Inhibitors," Methods 
in Enzvmology 249:284-303 (1995)). As an example, where 
two weakly binding ligands having binding affinities of 
17 mM and 0.02 mM were linked, the resulting compound had 
a much higher binding affinity of 15 nM (Hajduk et al . , 
"Discovery of Potent Nonpeptide Inhibitors of Stromelysin 
using SAR by NMR, " J. Am. Chem. Soc. 119:5818-5827 
(1997)). Accordingly, such bi-ligand compounds can be 
screened for their affinity to the enzyme or to other 
enzymes in the gene family. 

Another advantage of these bi-ligand libraries 
is that the CL mimic can provide a certain baseline 
affinity for other members of the gene family. Further 
addition of a specificity ligand or SL mimic can provide 
additional affinity to other particular members. Thus, 
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the library can be a rich source of specific bi-ligand 
inhibitors for multiple members of a gene family. 



Thus, the present invention further provides a 
5 method for screening libraries of bi-ligand compounds. 

After generating a combinatorial library of bi-ligand 
compounds as disclosed above, the binding of the 
compounds to the enzyme is measured. Individual 
compounds are then identified having greater binding than 
10 the CL mimic. Binding assays for enzymes and ligands are 

well known in the art and can be selected based on the 
particular enzyme and ligands being used. 

Individual bi-ligand compounds identified by 

15 this screening method are also encompassed within the 

invention. In particular, such compounds can be screened 
that increase or decrease the activity of the enzyme. 
Moreover, such compounds can have extremely high binding 
affinities, having binding affinities of 100, 200, 1000, 

20 5000 or even 10,000 times greater than the CL mimic's 

binding affinity. In addition to screening the bi-ligand 
compounds for affinity to the enzyme, it can be screened 
as an inhibitor for other members of the gene family. 
Binding to particular enzymes can also be particularly 

25 high when compared to other members of the gene family, 

so that the compound's binding affinity can be at least 
50, 100, 200, 500, or 1000 times greater to the 
individual enzyme than to another enzyme in the same gene 
family. This specificity is provided by the binding 

30 interactions with the specificity ligands or SL mimics, 

since the common ligand can bind with similar affinity to 
multiple members of gene family. 
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Once high affinity bi-ligand compounds are 
identified, their binding to the enzyme at both CL and SL 
sites may be verified by the NMR methods disclosed above. 

EXAMPLE I : NMR- SOLVE SM of DHPR 
from Mycobacterium tuberculosis 

The following example illustrates the NMR- 
SOLVE SM method for identifying an optimal CL mimic and an 
atom of a CL mimic that is proximal to the interface 
region of an enzyme. 

The enzyme dihydrodipicolinate reductase (DHPR) 
from Mycobacterium tuberculosis is selected because it 
plays a key role in the synthesis of the cell wall, and 
is vital for the survival of Mycobacterium (Pavelka and 
Jacobs, "Biosynthesis of Diaminopimelate , the Precursor 
of Lysine and a Component of Peptidoglycan, is an 
Essential Function of Mycobacterium smegmatis," 
Bacteriol. 178:6496-6507 (1996)). 

Tuberculosis is a desirable biological target 
for drug design, since the genome has recently been made 
available, providing a rich source of new drug targets 
(Cole et al., "Deciphering the Biology of Mycobacterium 
tuberculosis from the Complete Genome Sequence," Nature 
393:537-544 (1998)). Moreover, tuberculosis is the 
leading cause of death from infectious disease in adults 
(Dolin et al . , "Global Tuberculosis Incidence and 
Mortality During 1990-2000," Bull. WHO 72:213-220 
(1994)}. Furthermore, tuberculosis is showing a 
resurgence in developing nations with increasing reports 
of drug resistant strains (Snider et al . , "Global Burden 
of Tuberculosis," pages 3-11, in B.R. Bloom (ed.), 
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Tuberculosis: Pathogenesis, Protection, and Control , ASM 
Press, Wash. D.C. (1994)). 

5 A. Preparation of uniformly 15 N- and 13 C-labeled DHPR, 

with deuterium in non-exchangeable positions 

Genomic DNA from Mycobacterium tuberculosis is 
prepared with standard methods (Sambrook et al., 

10 Molecular Cloning: A Laboratory Manual , Cold Spring 

Harbor Laboratory,. Cold Spring Harbor NY (2nd ed. 1989) ) . 
The DHPR gene is cloned using oligonucleotide primers 
complementary to the ends of the gene from the open 
reading frame sequence (Pavelka et al., "Cloning of the 

15 dapB Gene, Encoding Dihydrodipicolinate Reductase, from 

Mycobacterium tuberculosis, " J. Bacteriol. 179:2777-2782 
(1997) (Accession number U66101) ) . Standard PCR reactions 
are used with the GENEAMP PCR kit from Perkin Elmer 
(Norwalk CN) . 

20 

The resulting fragment is cloned into both the 
pGEX (Pharmacia: Piscataway NJ) and pET41 (Novagen: 
Madison WI) vectors, using manufacturer's instructions, 
with appropriate restriction enzymes for the respective 

25 vectors. The plasmid/gene constructs are then 

transformed into commercially available competent E. coli 
BL21 (DE3) pLysS strains (Invitrogen: Carlsbad CA) , 
according to the manufacturer's instructions. The 
following expression is performed on a trial basis with 

30 each vector to empirically select for optimal expression 

yields . 

E. coli containing the expression construct is 
adapted to grow in 100% D 2 0 using standard methods and 
35 glycerol stocks are prepared (Venters et al., "High-level 

2 H/ 13 C/ 15 N Labeling of Proteins for NMR Studies," J\. 
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Biomol. NMR 5:339-344 (1995)). Expression is performed 
in defined media of 3 g/L sodium [1,2- 13 C2, 99%] acetate 
or 2 g/L [U- 13 C6, 99%] D-glucose as the sole carbon source 
and 1 g/L [ 15 N, 99%] ammonium chloride or [ 15 N, 99%] 
ammonium sulfate as the sole nitrogen source. All stable 
isotopes are from Cambridge Isotopes, Inc. (Andover MA) 
and Isotec, Inc. (Miamisburg OH) . The media also 
contains a modified M9 salt mixture: 44 raM Na 2 HP0 4 , 22 mM 
KH 2 P0 4 , 9 mM NaCl, 1 mM MgS0 4 , 0.1 CaCl 2 , 0.02 /uM FeCl 3 
and basal medium Eagle vitamin mix (Gibco-BRL) , with the 
appropriate antibiotic for the expression vector, such as 
50 /xg/mL carbenecillin ) . All reagents are prepared in 
D 2 0. 

This media is inoculated at 37 °C from either a 
glycerol stock or from a freshly plated colony. When the 
absorbance at 600 nm reaches 0.4-0.8, DHPR production is 
induced by adding isopropyl-|3-D-thiogalactopyranoside to 
a final concentration of 500 ju-M and incubated for 12 to 
18 hours. Cells are harvested by centrifugation at 5000 
rpm for 20 min at 5°C, then lysed with sonication. 

The protein is purified on a glutathione 
affinity column using the manufacturer' s instructions, 
which includes removal of the glutathione-S-transf erase 
enzyme with thrombin cleavage. The protein is also 
purified with FPLC anion- or cation-exchange 
chromatography using standard procedures (Pharmacia: 
Piscataway NJ) . To perform experiments relying on 1 H- 15 N 
correlations, the solvent-exposed deuterated amide 
positions are exchanged with protons by incubating the 
protein at pH 7.5 at room temperature for 2 to 24 hours, 
then exchanging into a lower pH buffer (such as 20 mM 
phosphate buffer at pH 6.0) at 5°C using a PD10 gel 
filtration column (Pharmacia) or CENTRICON 10 filter 
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(Millipore: Bedford MA). Although not all amides are 
exchanged, the solvent-exposed amides do exchange, and 
the unfilled active sites are sufficiently exposed to 
allow exchange. Avoiding complete exchange has the 
5 advantage of simplifying the 3D NMR spectra. Protein for 

NMR experiments is present at 0.5-5 mM, although lower 
concentrations can be used with a cryoprobe (Bruker: 
Billerica MA) . 

10 B. Identification, of CL mimics 

The common ligand of DHPR is NADH or NAD + . CL 
mimics are identified by displacement experiments. Such 
displacement is measured using standard steady-state 

15 kinetic analysis where the concentration of NADH is 

varied near its concentration in the presence of a 
single, fixed concentration of the other substrate, 
dihydrodipicolinate {Reddy et al., "Expression, 
Purification, and Characterization of Eschericia coli 

20 Dihydrodipicolinate Reductase," Biochemistry 34:3492-3501 

(1995); Reddy et al . , "Interaction of Pyridine Nucleotide 
Substrates with Eschericia coli Dihydrodipicolinate 
Reductase: Thermodynamic and Structural Analysis of 
Binary Complexes," Biochemistry 35:13294-13302 (1996)). 

25 This is repeated in the presence of several 

concentrations of a potential CL mimic. If 
Lineweaver-Burk plots (1/velocity versus 1/[NADH]) are 
prepared at the different concentrations of CL mimic 
inhibitor, according to the equation for a competitive 

30 inhibitor, an intersecting pattern will be obtained if 

the CL mimic binds in the NADH site (Cleland, W.W., 
Methods Enzvmol. 63:103-138 (1979)). 

It is also possible to identify CL mimics in a 
35 displacement assay that is generally useful for multiple 



members of a gene family. A f luorescently labeled CL 
mimic is displaced by a candidate CL mimic and the 
displacement is detected with fluorescence polarization 
(Burke et al . , Phage Display of Peptides and Proteins , 
5 Academic Press, New York NY (1996)). 

The above methods disclose how CL mimic 
candidates are experimentally validated as binding at the 
CL site. Molecules to be screened in such an assay can be 

10 synthesized or purchased from chemical supply houses. 

Commercially available molecules are preferably chosen 
according to some pre-selection criteria, such as by 
identifying those molecules present in a database of 
commercially available molecules (Asinex: Moscow, Russia) 

15 that have similar shape and electronic properties to a 

target structure such as the known common ligand (NADH) 
or a structurally characterized CL mimic (such as 
cibacron blue) . 

20 THREEDOM software was used to search for 

matches against a target structure. THREEDOM employs a 
gnomonic projection algorithm for shape-matching a target 
structure in an INTERCHEM format 3D structure coordinate 
file against a database of multiple structure/coordinates 

25 (Doucet and Weber, Computer-Aided Molecular Design: 

Theory and Applications , Academic Press, San Diego CA 
(1996) ) . The target used to search the ASINEX database 
was cibacron blue. Coordinates were obtained from the 
published 3D structure (Li et al., "The Three-dimensional 

30 Structure of NAD ( P) H : quinone Reductase, a Flavoprotein 

Involved in Cancer Chemoprotect ion and Chemotherapy: 
Mechanism of the Two-electron Reduction," Proc. Natl. 
Acad. Sci. USA 92:8846-8850 (1995)). The database was 
created by converting an SD format file of structures 

35 from ASINEX to INTERCHEM format coordinates using the 
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batch2to3 program. The target is compared against each 
structure in the database in multiple orientations to 
generate a matching score. Out of 37,926 structures 
searched, the 750 best matching scores were selected. 
5 From these 750 structures, 250 are selected and purchased 

based on objective criteria such as likely favorable 
binding interactions, pharmacophore properties, synthetic 
accessibility and likely pharmacokinetic, toxicological, 
adsorption and metabolic properties. 

10 

C. NMR-based Identification of Protein Interface Atoms 



NMR experiments are performed on a Bruker 700 
MHZ NMR spectrometer equipped with a 1 H/ 13 C/ 15 N triple 

15 resonance probe able to simultaneously perform broadband 

2 H decoupling and 2 H locking. Triple resonance 
experiments are performed, and data are processed with 
standard methodology (Cavanagh et al., Protein NMR 
Spectroscopy: Principles and Practice , Academic Press, 

20 New York (1996) ) . 

A 1 mM solution of labeled ( 2 H, 13 C and 15 N) DHPR 
is prepared in 95% H 2 0/5% D 2 0, pH 6.0 and kept at 5°C. 
All of the NMR experiments are performed in the presence 

25 of a stoichiometric excess of 2 , 6-pyridinedicarboxylate . 

The compound 2 , 6-pyridinedicarboxylate is an analog of 
dihydrodipicolinate, the other DHPR substrate. The 
compound is present in excess to produce a dead-end 
complex that forces the enzyme into a closed conformation 

30 so that SL-binding and CL-binding domains are proximal 

(Scapin et al., "Three-Dimensional Structure of 
Eschericia coli Dihydrodipicolinate Reductase in Complex 
with NADH and the Inhibitor 2 , 6-Pyridinedicarboxylate , " 
Biochemistry 36:15081-15088 (1997); Wang et al., 

35 "Hydrogen/Electrospray Ionization Mass Spectrometry 
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Studies of Substrate and Inhibitor Binding and 
Conformation Changes of Escherichia coli 

Dihydrodipicolinate Reductase, Biochemistry 36:3755-3759 
(1997) ) . 

5 

3D HNCO and 3D HNCA data are collected for the 
binary complexes of enzyme bound with 

2 , 6-pyridinedicarboxylate, and for ternary complexes 
further binding NADH or NADD. In addition, 3D HNCA 

10 spectra are collected at multiple concentrations of NADH 

to identify those cross peaks in the binary complex 
corresponding to those in the ternary complex that are 
most affected by the H to D substitution between NADH and 
NADD. NADD [4R position labeled] is prepared 

15 enzymatically from NAD + and perdeutero ethanol using 

alcohol dehydrogenase (Sem and Kasper, "Geometric 
Relationship between the Nicotinamide and Isoalloxazine 
Rings in NADPH-Cytochrome P-450 Oxidoreductase : 
Implications for the Classification of Evolutionarily and 

20 Functionally Related Flavoproteins , " Biochemistry 31: 

3391-3398 (1992) ) . 

The experiments performed with NADD are 
repeated with the other NADH analog, nicotinic acid 

25 adenine dinucleotide (reduced) , which has the carboxamide 

group replaced with a carboxylic acid. The reduced form 
is prepared enzymatically from the oxidized form of 
nicotinic acid adenine dinucleotide using alcohol 
dehydrogenase and unlabeled ethanol, as disclosed above. 

30 Although multiple NAD(P) (H) analogs are available for 

these perturbation studies, those chosen provide adequate 
chemical shift perturbations for this study (Everse et 
al. The Pyridine Nucleotide Coenzymes , Academic Press, 
New York NY (1982) ) . Because the changes are made to the 

35 portion of the NADH molecule known to be closest to the 
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SL site, the cross peaks that shift as a result of the 
chemical changes to NADH are therefore determined to be 
at the interface region between the two binding sites. 

5 HNCO and HNCA experiments (Kay et al . , J. Magn. 

Reson. 89:496-514 (1990); Grzesiek and Bax, J. Magn. 
Reson. 96:432-440(1992)) were performed as the TROSY 
implementation to produce narrower line-widths for the 
15 N- 1 H correlations (Salzmann et al., "TROSY in Triple 

10 Resonance Experiments: New Perspectives for Sequential 

NMR Assignment of Large Proteins," Proc. Natl. Acad. Sci. 
USA 95:13585-13590 (1998); Salzman et al . , "TROSY-type 
Triple-Resonance Experiments for Sequential NMR 
Assignments of Large Proteins, J. Am. Chem. Soc. 121:844- 

15 848 (1999) ) . Deuterium labeling and decoupling is also 

included to produce narrower line widths, particularly 
for 13 C resonances (Yamazaki et al., "A Suite of Triple 
Resonance NMR Experiments for the Backbone Assignments of 
15 N, 13 C, 2 H Labeled Proteins with High Sensitivity," 

20 Am. Chem. Soc. 116:11655-11666 (1994)). Spectra are 

processed with both Felix (MSI: San Diego CA) and NMRPipe 
software (NIH: Bethesda, MD) (Delaglio et al . , "NMRPipe: 
a Multidimensional Spectral Processing System Based on 
UNIX Pipes" J. Biomol. NMR 6:277-293 (1995)). 

25 

D. NMR-based Identification of Proximal CL mimics and 
Proximal Atoms on CL mimics 

NMR experiments are performed as in the first 
30 paragraph of section C. 3D HNCA data are collected at 

multiple concentrations of each CL mimic identified in 
section B as a competitive inhibitor against NADH. At 
the highest concentration of the CL mimic, two types of 
NOESY experiments are performed. The first is a TROSY 
35 implementation of the 3D [ 1 H- 15 N] HSQC-NOESY. Since there 

is significant cross-peak overlap in the 3D experiment, 
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4D versions of the HNCA and HNCO experiments are also 
performed (Kay et al . , "Four-Dimensional Heteronuclear 
Triple-Resonance NMR Spectroscopy of Interleukin-lb/ in 
Solution," Science 249: 411-414 (1990)). The 4th 
5 dimension is for NOEs from the 15 N-attached protons to any 

other proton. The NOEs of interest are those from the 
perturbed cross-peaks identified in section C to any 
protons on the CL mimic being studied. These protons 
therefore represent optimal positions on the CL mimic for 
10 library expansion by attaching linkers. 

E . Linker Attachment and Validation 

Once a linker is synthetically attached to the 
15 CL mimic according to atoms of interest identified in 

section D, NMR experiments are performed on the DHPR 
ternary complex with 2 , 6-pyridinedicarboxylate and the 
modified CL mimic. NMR experiments are performed as in 
the first paragraph of section C. 3D HNCA data are 
20 collected at multiple concentrations of the modified CL 

mimic (with linker attached) , as disclosed in section D. 
These cross-peak perturbations are compared with those 
observed in previous (section D) experiments with the 
original unmodified CL mimic. From these data it is 
25 established that the modified CL mimic contacts the same 

binding site atoms in both complexes. Additional 
cross-peak perturbations due to attachment of a linker 
should also correspond to some of the interface atoms 
identified in section C. 

30 

In addition to NMR experiments, steady-state 
inhibition experiments are performed as disclosed in 
section B to establish that the modified CL mimic is 
still a competitive inhibitor against NADH, and to 
35 determine the dissociation constant with DHPR to 
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establish that adding the linker does not significantly 
disrupt the strength of the binding interactions with 
DHPR. 

5 EXAMPLE II: NMR-SOLVE SM of DHPR from Escherichia coli 

Given the similarity of DHPR protein sequences 
in Mycobacterium tuberculosis and Escherichia coli, and 
the higher levels of expression in E. coli, DHPR from E. 
10 coli was used in the following example. 

The common ligand for DHPR was NADH (Figure 9b) 
and the specificity ligand was dihydrodipicolinate . An 
analog of NADH, AcNADH (Figure 9c) was used for chemical 
15 perturbation experiments. Similarly, an analog of 

specificity ligand dihydrodipicolinate was used: SL 
analog 2 , 6-pyridinedicarboxylate (2,6-PDC) (Figure 9a). 

A. Preparation of uniformly 15 N-labeled DHPR, with 
20 deuterium in non- exchangeable positions, except 

threonines, which were 13 CH 3 - labeled 

1. Expression and isotopic enrichment of DHPR 

25 DHPR was prepared that was uniformly enriched 

in 2 H and 15 N and containing W^N/^C-labeled threonine 
residues. 2 mL LB media with 100 mg/mL carbenicillin was 
inoculated aseptically with 20 mL of a glycerol stock of 
E. coli containing the pETlla+/DHPR expression construct 

30 (Reddy et al., supra) . This culture was grown to OD 600 

0.4-0.5 at 37°C. Cells were then conditioned to grow on 
deuterated media. The first step was a 50-fold dilution 
of the LB culture into 2mL of 90% D 2 0 from Cambridge 
Isotope Laboratories or Isotec, Inc. in minimal media 

35 containing 5 g/L D-glucose, 2 g/L NH 4 C1, 10.725 g/L 
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Na 2 HP0 4 -H 2 O, 4.5 g/L KH 2 P0 4 , 0.75 g/L NaCl, 2mM MgS0 4 and 
2mL (Ix) of the following trace metal and nutrient 
solution: 2 mg/mL CaCl 2 , 2 mg/mL ZnS0 4 -7H 2 O, 15 mg/mL 
thiamine, 10 mg/mL niacinamide, 1 mg/mL biotin, 1 mg/mL 
5 choline chloride, 1 mg/mL pantotenic acid, 1 mg/mL 

pyridoxine, 1 mg/mL folic acid, 10.8 mg/mL FeCl 3 -6H 2 0, 0.7 
mg/mL Na 2 Mo0 4 • 2H 2 0, 0.8 mg/mL CuS0 4 -5H 2 0 and 0.2 mg/mL 
H3BO3. 

10 This culture was grown to OD 600 0.3-0.4 at 37°C 

and then diluted 40-fold into 5 mL of 100% D 2 0 M9 minimal 
media, which was identical to the media above except the 
D-glucose was uniformly 2 H-enriched (Martek Biosciences 
Corp. : Columbia MD) and the ammonium chloride was 

15 uniformly 15 N-enriched (Martek) . This 5mL culture was 

grown overnight and then used to inoculate lOOmL of the 
same media. The lOOmL culture was grown at 37 °C in a 1L 
baffled shaker flask for 16 hours to a final OD 600 of 
4.5-5.0. 

20 

Protein expression was then carried out on a 1L 
scale in a B10FL0W 3000 fermentor (New England Biolabs: 
Beverly MA) . The pH of the culture was monitored via a 
gel-filled pH probe and maintained at pH 7.0 by the 
25 automated feeding of 0 . IN NaOD. The culture was aerated 

by continuous sparging of dried air at a flow rate of 5 
L/minute. The temperature was maintained at 37 °C via a 
recirculating chiller, and the dissolved oxygen level was 
monitored via a D.O. probe. 

30 

The lOOmL shaker flask culture was used to 
inoculate 1L basal fermentation media containing 2 gm/L 
2 H-D-glucose, 0.8 g/L 15 NH 4 C1 and 0 . 5x of the trace metal 
and vitamin mix. The culture was grown until the pH feed 
35 was inactive and the dissolved oxygen level began to 
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rise, at which time a batch-feed solution was added, 
consisting of 3 gra/L 2 H-D-glucose , 1.2 gm/L 15 NH 4 C1, 0.5x 
of the trace metal and vitamin mix and lOOmg U- 2 H/ 15 N/ 13 C- 
labeled threonine (Cambridge) . After a re-equilibration 
5 period of 10 to 15 minutes, protein expression was 

induced by adding 2mM IPTG. The induction phase was 
carried out until the pH feed was inactive and the pH 
value began to rise with final cell densities ranging 
from OD 600 4-5 . The cells were then pelleted at 5000xg, 
10 4°C for 10 minutes in four 250 mL fractions and frozen at 

-80°C. 

2 . Purification 

15 The cell pellet from one of the fractions was 

resuspended in 250mL ice-cold lysis buffer consisting of 
50mM Tris pH 7.5, lOOmM NaCl, ImM EDTA, and lmL protease 
inhibitor cocktail. The resuspended cell pellet was 
homogenized for 3 minutes. The homogenized, resuspended 

20 cells were then lysed by three passes at 18,000 PSI 

through a microf luidizer . The soluble fraction was 
separated from the insoluble cellular debris by 
centrifugation at 20,000xg, 4°C for 45 minutes. The 
supernatant was then dialyzed against 2L of 50mM Tris pH 

25 7.8, ImM EDTA for 4 hours and subsequently cleared by 

centrifugation at 20,000xg, 4°C for 45 minutes. 

The cleared, dialyzed soluble cellular fraction 
was then loaded on a lOOmL FAST FLOW Q-SEPHAROSE 

30 quaternary ammonium ion exchange column (Pharmacia) , that 

had been equilibrated in 25mM Tris pH 7.8, ImM EDTA. The 
column was washed extensively with equilibration buffer, 
and the protein eluted in 5mL fractions with a 1L 0-1 M 
NaCl gradient at a flow rate of 1 mL/minute. Eluted 

35 fractions were analyzed by SDS-PAGE. Fractions 



containing significant levels of DHPR were pooled and 
loaded in two fractions onto 50mL FAST FLOW BLUE 
SEPHAROSE 6 dye-affinity column (Pharmacia) , which had 
been equilibrated in 2 0mM Tris pH 7.8, ImM EDTA. The 
column was washed with 2 column volumes of equilibration 
buffer and the protein was eluted with equilibration 
buffer containing 2M NaCl . Approximately 200 mg/L of 
>99% pure DHPR was obtained, representing a yield of 
about 150mg/L. 

The protein was 13 CH 3 -labeled at threonine 
positions so that NMR-observable atoms were present on 
the protein in addition to amides. Thus, NMR-SOLVE 
experiments could be performed that involved either 
backbone amides in 1 H- 15 N correlation spectra or at methyl 
groups of threonines in 1 H- 13 C correlation spectra. 
Narrow line widths were obtained in 1 H- 15 N correlation 
spectra by using the TROSY pulse sequence (Salzman et al. 
(1998) supra; Salzman et al. (1999) supra; Wider and 
Wuethrich, "NMR spectroscopy of large molecules and 
multimolecular assemblies in solution" Current Opinion in 
Structural Biology 9:594-601 (1999)). In^-^C 
correlation spectra, narrow line widths were obtained 
because of the fast rotation of methyl protons. Methyl 
protons have the added advantage of increased sensitivity 
because of the presence of three equivalent protons. 
Specific labeling of residues, such as threonines, has 
the advantage of simplifying 2D-NMR spectra. 

B. Identification of CL mimics 

CL mimics for DHPR include NADH, NAD, NADPH or 
NADP. Mimics of NADH or NAD were obtained by screening 
for displacement of the natural common ligand with the 
enzyme lactate dehydrogenase (LDH) . Such displacement 



was measured using standard steady-state kinetic 
analysis, where NADH was varied around its K m in the 
presence of a single fixed concentration of the other 
substrate, pyruvate (Zewe and Fromm, Biochem istry 4:782- 
792 (1965)). This was repeated in the presence of 
several concentrations of a potential CL mimic. 

When Lineweaver-Burk plots (1/rate versus 
1/ [NADH] ) were drawn at the different concentrations of 
CL mimic inhibitor, an intersecting pattern would be 
obtained if the CL mimic was binding in the NADH site, 
according to the equation for a competitive inhibitor 
(Cleland, supra) . It was also possible to identify CL 
mimics in a generally useful (across a gene family) 
displacement assay: a f luorescently labeled CL mimic was 
displaced by a candidate CL mimic. The displacement was 
detected with fluorescence polarization. This can be 
performed in the same manner as with peptides (Burke et 
al., supra), but with CL mimics. 

The method above describes how CL mimic 
candidates were experimentally validated for binding in 
the CL site. Molecules to be screened in such assays 
were synthesized or purchased from chemical supply 
houses. The candidates were pre-selected by 
computationally identifying molecules present in a 
database of commercially available molecules (ASINEX: 
Moscow, Russia) that had similar shape and electronic 
properties to either the known common ligand, NADH, or a 
structurally characterized CL mimic, such as cibacron 
blue . 



THREEDOM software (Interprobe Chemical 
Services: Glasgow, Scotland) was used to perform this 
search for matches against a target structure. THREEDOM 



employed a gnomonic projection algorithm (Doucet and 
Weber supra) for shape-matching a target structure in an 
Interchem-format 3D-structure coordinate file (Interprobe 
Chemical Services: Glasgow, Scotland) against a database 
of multiple structures/coordinates. The target used to 
search the ASINEX database was cibacron blue. 
Coordinates were obtained from the published 3D structure 
(Li et al. supra) . The database was created by 
converting an SD format file of structures from ASINEX to 
Interchem format coordinates using the batch2to3 program 
(Interprobe Chemical Services) . 

The target was compared against each structure 
in the database in multiple orientations, and a matching 
score was generated for the optimal orientation and 
overlay. Out of 74,500 structures searched, the 1244 
with best matching scores were selected and purchased. 
These compounds were screened in the above-mentioned 
steady-state inhibition assay with LDH. One of the 
inhibitors identified, designated as TTE0001 . 002 . D2, is 
shown in Figure 8a. 

The inhibition profile for that compound is 
shown in Figure 8b. The Lineweaver-Burk plot for the 
compound with lactate dehydrogenase was plotted showing 
1 / [NADH] {fJ-M)' 1 on the horizontal axis versus 1/rate 
(min/Abs) on the vertical axis. Reactions were carried 
out in 0.1M HEPES buffer pH 7.4, 2.5 mM pyruvate and 5 
/^g/mL LDH, with DMSO kept fixed at 3%. NADH was varied 
at 5.4, 7.0, 10.1, and 2 0.1 /uM, and the compound was 
present at 0 (solid circles), 6.9 (open circles), 13.7 
(closed triangles) and 20.6 /jM (open triangles). Initial 
rates were measured by monitoring absorbance changes at 
34 0 nm. The competitive inhibition pattern indicated 
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that it bound in the NADH (common ligand) site with 
dissociation constant of 2.1 + 0.4 fM. 



C. NMR-based identification of protein interface atoms 

5 

NMR experiments were performed on a 7 00 MHZ NMR 
spectrometer (Bruker NMR: Fremont CA) equipped with a 
1 H/ 13 C/ 15 N triple resonance probe able to simultaneously 
perform broadband 2 H decoupling and 2 H locking. Multiple 

10 resonance experiments were performed. Data were 

processed with standard methodology (Cavanagh et al. 
supra) . Most experiments were performed with about 1 mM 
DHPR samples at about pH 7.5, 25 mM Tris-d n buffer in 99% 
D 2 0 or in 95% H 2 0/ 5% D 2 0, with sample at 30°C. "C-^ 

15 correlation experiments were obtained via HMQC 

magnetization transfer because it was more sensitive than 
HSQC for the particular samples. Adiabatic WURST 13 C P 
decoupling was applied during the 13 C tl evolution time. 
A 5ms WURST-10 adiabatic pulse shifted at 70 ppm was 

20 applied to decouple l3 C Y from 13 C|3 . Typically, 100 * 2048 

complex points were acquired with 16 to 32 scans per 
increment, with a total time between about 1 to 2h per 
HMQC experiment. Typical 2D [^H/H] NOESY were acquired 
with 256 * 2048 complex points and with mixing times 

25 between 100 ms and 500 ms . 13 C Y decoupling during tl 

evolution was achieved with a 13 C 180 degree refocusing 
pulse. 13 C decoupling during the acquisition was achieved 
with a GARP composite decoupling sequence. The measuring 
time for a 2D ^H/H] NOESY varied from about 12 to 48 h. 

30 

Ligand characterization and structure 
elucidation was accomplished with standard ID 1 H and ID 
13 C NMR, as well as 2D ^H/H] COSY and, in some cases, 2D 
[ 1 H, 1 H] ROESY. Frequently, NMR experiments were performed 
35 in the presence of 3 mM 2, 6-pyridinedicarboxylate (2,6- 
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PDC) , to produce a dead-end complex that would force the 
enzyme into a closed conformation (Scapin et al. supra; 
Wang et al. supra). 2 , 6-pyridinedicarboxylate (Figure 
9a) was an analog of the other substrate, 
dihydrodipicolinate . Substrate and cof actor domains of 
dehydrogenases most frequently adopt a closed 
conformation where both domains are proximal and when 
both ligands are bound. 

2D 1 H- 13 C HSQC data were collected for the 
enzyme without ligand bound (Figure 10a), and for the 
binary complex with NADH (Figures 9b and 10b) or the 
binary complex with 3-acetylpyridine adenine dinucleotide 
(AcNADH) (Figures 9c and 10c) . In addition, spectra were 
collected at multiple concentrations of NADH or AcNADH to 
identify cross peaks in the apo enzyme corresponding to 
the binary complex that were most affected by the amine- 
to-methyl substitution in going from NADH (Figures 9b and 
10b) to AcNADH (Figures 9c and 10c) . 

Although multiple NAD ( P) (H) analogs are 
available for these perturbation studies (Everse et al . 
supra) , AcNADH provided adequate chemical shift 
perturbations for this study. The protein cross peaks 
that shifted as a result of the chemical changes to the 
NADH molecule were at the interface of the two binding 
sites, since the changes were made to a part of the NADH 
molecule known to be proximal to the specific binding 
site. Two protein cross-peaks were perturbed by the 
amine (Figure 9b) to methyl (Figure 9c) substitution on 
NADH, shown by comparing Figures 10b and 10c. Those 
effects were quantified in Figure 11. 

In addition to chemically perturbing the common 
ligand NADH and looking for an effect on protein, protons 
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on AcNADH were also perturbed by irradiation, as in a 
standard 2D NOESY experiment (Figure 12), and looking for 
effects on the protein in the form of an NOE, as 
indicated by a cross-peak between the AcNADH protons and 
protons on the protein corresponding to cross-peak #2 in 
Figures 9c and 10c. 

In this case, because of the specific labeling 
of the protein, the protons on the protein were the 
methyl protons of threonines- The combination of these 
two experiments unambiguously identified cross-peak #2 as 
corresponding to atoms at the interface of the CL (NADH) 
site and SL site. Furthermore, perturbation by 
irradiation of the interface atoms on the other ligand, 
2,6-PDC, also produced NOEs to cross-peak #2 in the 
ternary complex, further confirming its assignment as 
corresponding to an interface atom. It should be noted 
that the additional NOEs in Figure 11 from the H8A and 
HI' A protons of AcNADH were to another threonine methyl 
that was in a part of the CL site, which was more remote 
from the interface region. 

D. NMR-based identification of proximal CL mimics and 
proximal atoms on CL mimics 

CL mimic protons were first assigned based on 
2D-C0SY data before binding the CL mimic to the DHPR 
protein, and also in the presence of protein. NMR 
experiments were performed as in the first paragraph of 
section C. 2D ^-"C HMQC data were collected at multiple 
concentrations of the CL mimic identified in section B. 
At the highest concentration of the CL mimic, a 2D-NOESY 
experiment was carried out with a mixing time of 200 msec 
(Figure 13) . The NOEs of interest were from the 
perturbed interface cross-peak identified in section C to 



any protons on the CL mimic being studied. NOEs from the 
interface atom identified as cross-peak #2 matched only 
the HI proton of the CL mimic TTE0001. 002. D2. This 
confirmed the assignment of the Hi proton as an atom 
proximal to the interface region. As shown, NOEs are 
also observed from the interface atom identified as 
cross-peak #2 to bound SL analog 2,6-PDC. Also, 
perturbation of the interface atoms on the 2,6-PDC 
produced an NOE to the HI proton of the CL mimic. 

Thus, proton HI represents the optimal region 
of the CL mimic TTE0001 . 002 . D2 to attach a linker for bi- 
ligand library expansion, so that diversity elements can 
be geometrically directed into the SL site. 

Each of the references and U.S. Patents cited 
above is hereby incorporated herein by reference. 

Although the present invention has been 
exemplified by the disclosed embodiments, those skilled 
in the art will readily appreciate that the specific 
examples are provided to illustrate, not to limit, the 
invention. Thus, while bacterial enzymes have been 
presented for purposes of illustration, the methods of 
the invention may be readily applied to identifying drug 
candidates effective against enzymes involved in other 
diseases where inhibition of bi-ligand enzymes would be 
advantageous, for example cancer, cardiovascular disease 
and other microbial and viral infections such as HIV. It 
should therefore be understood that various modifications 
can be made without departing from the spirit of the 
invention. Accordingly, the invention is limited only by 
the following claims. 



